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Abstract 

The start of LHC has motivated an effort to determine the relative probability of 
the different regions of the MSSM parameter space, taking into account the present, 
theoretical and experimental, wisdom about the model. Since the present experimental 
data are not powerful enough to select a small region of the MSSM parameter space, 
the choice of a judicious prior probability for the parameters becomes most relevant. 
Previous studies have proposed theoretical priors that incorporate some (conventional) 
measure of the fine-tuning, to penalize unnatural possibilities. However, we show that 
such penalization arises from the Bayesian analysis itself (with no ad hoc assumptions), 
upon the marginalization of the //—parameter. Furthermore the resulting effective prior 
contains precisely the Barbieri-Giudice measure, which is very satisfactory. On the other 
hand we carry on a rigorous treatment of the Yukawa couplings, showing in particular 
that the usual practice of taking the Yukawas "as required", approximately corresponds 
to taking logarithmically flat priors in the Yukawa couplings. Finally, we use an efficient 
set of variables to scan the MSSM parameter space, trading in particular B by tan/3, 
giving the effective prior in the new parameters. Beside the numerical results, we give 
accurate analytic expressions for the effective priors in all cases. Whatever experimental 
information one may use in the future, it is to be weighted by the Bayesian factors worked 
out here. 
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1 Introduction 



The imminent start of LHC has motivated an interesting effort (see refs. [1-8]) to antici- 
pate which kind of supersymmetric model is more likely to be there, or, in more precise 
words, which region of the parameter space of the minimal supersymmetric standard 
model (MSSM) is more probable, taking into account the present (theoretical and ex- 
perimental) wisdom about the model. This wisdom includes theoretical constraints (and 
perhaps prejudices) and experimental constraints, such as electroweak precision tests. 
The idea is to use this information to determine the relative probability of the different 
regions of the MSSM parameter space, thus the frequent expression "LHC forecasts". 
The appropriate framework to evaluate this probability is the Bayesian approach, which 
allows to separate in a neat way the objective and subjective pieces of information. 

In the Bayesian analysis one tries to make inferences about the relative probability of 
different " states of nature" (corresponding to different values of the parameters defining 
the model, say pi) upon the observation of different data which are determined^ completely 



The probability density of a particular point in the parameter space, given a 
certain set of data, is the so-called posterior probability density function (pdf), p(p°|data), 
which is given by the fundamental Bayesian relation (for a review see ref. [9]) 



Here p(data|p°) is the likelihood (sometimes denoted by £), i.e. the probability density of 
measuring the given data for the chosen point in the parameter space. E.g. for observables 
measured within a gaussian uncertainty, £ is proportional to e~? x , where x 2 is the 
conventional chi-squared. p{pf) is the prior, i.e. the "theoretical" probability density that 
we assign a priory to the point in the parameter space. Finally, p(data) is a normalization 
factor which plays no role unless one wishes to compare different classes of models, so for 
the moment it can be dropped from the previous formula. 

One can say that in eq. (11.11) the first factor (the likelihood) is objective, while the 
second (the prior) is subjective, since it contains our prejudices about which regions of 
the parameter space are more "natural" or "expectable". It is desirable that the results 
of the analysis are as independent as possible of the chosen prior. This happens if the 
data are powerful enough to select a very small region of the parameter space, so that 
eq. fll.il) is dominated by the likelihood, i.e. essentially the pdf is non-zero just in the 
narrow region of non- vanishing p(data|p;). However, in many instances this is not the 
case, as it happens for the MSSM. 

The somewhat subjective character of the prior, p{p®), has often motivated to ignore its 
presence, identifying in practice p(p°|data) with p(data|p°). However, it must be noticed 
that this procedure implicitly implies a choice for the prior, namely a completely flat prior 
in the parameters. This is not necessarily the most reasonable or "free of prejudices" 

4 Normally this determination takes the form of a probability distribution since the theoretical com- 
putations and the experimental data arc affected by different kinds of errors and uncertainties. 
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attitude. Note for example that using p\ as initial parameters instead of pi the previous 
flat prior becomes non-flat. So one needs some theoretical basis to establish, at least, the 
parameters whose prior can be reasonably taken as flat. 

If we are interested in the most probable value of one (or several) of the initial pa- 
rameters, say Pi, % — 1, ...,Ni, but not in the others, Pi, i = N\ + 1, ...,iV, we have to 
marginalize the latter, i.e. integrate in the parameter space: 

p(Pi, i = 1, ...,iVi|data) = J dp Nl+1 , dp N p(p h i = 1, JV|data) . (1.2) 

This procedure is very useful and common to make predictions about the values of partic- 
ularly interesting parameters. It must be noticed that, in order to perform the marginal- 
isation, we need an input for the prior functions and for the range of allowed values of 
the parameters, which determines the range of the definite integration ( 11. 21) . A choice for 
these ingredients is therefore inescapable in trying to make LHC forecasts. 

Let us now particularize these general statements to the MSSM (for a review see 
[10]). Beside the Standard Model (SM) -like parameters (to be discussed below), the 
MSSM contains a great number of parameters associated with the unknown process of 
supersymmetry (SUSY) breaking, the so-called soft SUSY-breaking terms. Assuming 
universality of these terms at a given high scale (namely the scale at which the SUSY 
breaking is transmitted to the observable sector), these parameters are reduced to four: 
the universal scalar mass, m, the universal gaugino mass, M, the universal trilinear scalar 
coupling, A, and the bilinear scalar coupling, B. The universality assumption is in part 
justified by the need of keeping the FCNC processes under control and it does come out 
naturally in several schemes of SUSY breaking mediation, e.g. minimal SUGRA or gauge- 
mediated models (for a review see [11] and [12] respectively). Beside these four parameters 
one has to include the /^-parameter (i.e. the Higgs mass term in the superpotential) as 
an additional independent parameter, presumably with a magnitude similar to the soft 
breaking terms, as it is demanded by a successful electroweak breaking (see below). The 
notation used here is consistent with refs. [10,13]. 

The SM-like parameters of the MSSM include the 577(3) x SU(2) x U(1) Y gauge 
couplings, #3, g, g', and the Yukawa couplings, which in turn determine the fermion masses 
and mixing angles. An important difference from the SM is that the MSSM contains two 
Higgs doublets, Hi, H 2 , with expectation values Uj = (Hf) determined by the parameters 
of the model upon minimization of the scalar potential, V(H\, H 2 ). They have to fulfill 
2{y\-\-vl) = v 2 = (246 GeV) 2 . The down-type-quark masses go like m d ~ y d V\ = y d v cos (3, 
where tan/5 = v 2 /vi. Similarly for the up-type- quarks m u ~ y u v 2 = y u vsinj3, and for the 
charged leptons, m e ~ y e V\ = y e v cos f3. Hence the values of the Yukawa couplings which 
give the observed fermion masses depend on the derived parameter tan j3, a fact that will 
be relevant later in our discussion. 

In sect. 2 we address some basic aspects of the Bayesian approach for the MSSM, 
showing in particular that a penalization of the fine-tuning arises from the Bayesian 
analysis itself (with no ad hoc assumptions as in previous analyses), upon the marginal- 
ization of the \x— parameter (subsect. 2.1). We also present a rigorous treatment of the 
Yukawa couplings, showing that the usual practice of taking the Yukawas "as required" , 
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approximately corresponds to taking logarithmically flat priors in the Yukawa couplings 
(subsect. 2.2). In sect. 3 we use an efficient set of variables to scan the MSSM parameter 
space, trading in particular B by tan /3, giving the effective prior in the new parameters. 
Finally, in sect. 4 we summarize our results and conclusions. 



2 Some basic aspects 

2.1 Connection between the Bayesian approach and the fine- 
tuning measure 

It is common lore that the parameters of the MSSM, {m, M, A, B, /i}, should not be far 
from the electroweak scale in order to avoid unnatural fine-tunings to obtain the correct 
scale of the electroweak breaking. This can be easily appreciated from the minimization of 
the tree-level form of the scalar potential, V(Hi, H2), which gives the expectation values 
of the Higgses, and thus the value of Mf = \{g 2 + g' 2 )(v\ + namely 

m 2 „ — m 2 TT n tan 2 8 . 

M * = 2 w"-i 2 " ■ (2 ' 1) 

Unless the /1— term and the soft masses m#. (which upon the renormalization running 
depend also on the other soft terms) are close to the electroweak scale, a funny cancellation 
among the various terms in the right hand side of (12 .ip is necessary to get the experimental 
M z . 

A conventional measure of the degree of fine-tuning is given by the Barbieri-Giudice 
fine-tuning parameters [14]: 
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(2.2) 



which weigh up the sensitivity of Mz with respect to the parameters of the model, p,. 
The global measure of the fine-tuning is taken as c = maxjcj} or c = cf [14-17]. 

Previous studies have attempted to incorporate this fine-tuning measure to the Bayesian 
approach through the prior p(pi). In particular, in refs. [2,18] a prior p(pi) oc 1/c was 
proposed^]. In principle this is not unreasonable since 1/c approximately indicates the 
probability of a cancellation among the various terms contributing to Mf to give a result 
< (A<ff xp ) 2 . This can be intuitively seen as follows. Expanding Mf (pj) around a point 
in parameter space that gives the desired cancellation, say V° = {p®}, up to the linear 
term in the parameters, one finds that only a small neighborhood SV ~ V°/c around this 
point gives a value of Mf smaller or equal to the experimental value [15]. Hence, if one 
assumes that V could reasonably have taken any value of the order of magnitude of V°, 
then only for a small fraction ~ 1/c of this region one gets M§ < (M z xp ) 2 , thus the rough 
probabilistic meaning of c. 



Another prior designed to catch the naturalness criterion has been proposed in ref. [4]. 
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However, though reasonable, the above-mentioned proposals for priors are rather arbi- 
trary, as the very measure of the fine-tuning is. On the other hand, since the naturalness 
arguments are deep down statistical arguments, one might expect that an effective pe- 
nalization of fine-tunings should arise from the Bayesian analysis itself, with no need of 
introducing "naturalness priors" ad hoc. This is in fact the case, as we are about to see. 

Let us consider Mz as an experimental data, on a similar foot to the rest of physical 
observables. Then the total likelihood reads 



p(data|s, m, M, A, B, fi) = 



N z e-\*z C 



rest 



(2.3) 



where s represents the SM-like parameters, £ rcs t is the likelihood associated to all the 
physical observables, except M z , and 



Xz 



oz 



(2.4) 



where o~z *C M| xp is the experimental uncertainty in the Z mass; finally Nz = 1/ 

is a normalization constant. Let us now use this sharp dependence on Mz to marginalize 

the pdf in the fi— parameter, performing a change of variable fi — > Mz'- 



p(s,m, M, A, B\ data) 



N z 

•^rest 



dfi p(s, m, M, A, B, /x|data 
dfi 



dM z 

dfi 

dM? 



e x z £ rest p(s, m, M, A, B, fi) 
p(s,m,M,A,B,fi ) 



(2.5) 
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where fi is the value of fi that reproduces the experimental value of M z for the given 
values of {s, m, M, A, B}. In the last line of (12.51) we have approximated Nz e~^ Xz ~ 
5(Mz — M| xp ). Essentially the same result is obtained by performing the \x— integration 
in the stationary point approximation. Now, comparing (12. 5p to the definition of fine- 
tuning parameters (12.21) . we can write 



p(s,m,M,A,B\ data) = 2 £ r 



Ho 1 



M z 



p(s,m,M,A,B,fi ) . 



(2.6) 



Several comments are in order here. First, the presence of the fine-tuning parameter, 
penalizes the regions of the parameter space with large fine-tuning, as desired. Actually 
eq. (I2.6P is very similar to multiply by hand the initial prior in the parameters by a factor 
1/c, as in ref. [2]. The difference is that here the factor l/c M has not been put by hand: 
it comes out from the marginalization in \i. Moreover the prior p(s,m, M, A, B, /x ) is 
still undefined. If one takes it as flat, then one gets the same as in ref. [2], but with 
one factor \x in the numerator (still the regions of large fine-tuning are penalized since c M 
goes parametrically as ~ fi 2 ). If one takes logarithmically flat priors, i.e. p{fi) ex 1/ fi, 
then eq. ( 12. 61) would formally coincide with the procedure of multiplying the theoretical 
prior p(s, m, M, A, B) by a factor 1/c. This is reasonable: the usual naturalness criteria 
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implicitly assume that for a given value of one parameter, say fi = /jLq, the prior probability 
is distributed around /zo [15,17] with a width ~ /j, [see the brief discussion in the paragraph 
after eq. (12.21) ]. This is equivalent to assume that the value fi = has a prior probability 
oc l//io- Actually this is the reason why, according to usual fine-tuning arguments, large 
soft parameters are more unlikely than small ones: for the former the region of the 
parameter space that produces the observed electroweak scale is much narrower than for 
the latter, not in absolute value, but compared to the size of the soft parameters in each 
case. Assuming flat priors there would be no reason to prefer soft parameters of the 
electroweak size instead of e.g. order Mqut- The fact that even for flat priors we still get 
a penalty factor /i/c M comes from the assumption of a prior flat in \x instead of /i 2 , which 
is the quantity that appears in the cancellation [see e.g. eq. (12.11) ]. 

We find very satisfactory that the usual parameter to quantify the degree of fine-tuning 
emerges from the Bayesian approach "spontaneously", not upon subjective assumptions, 
especially taking into account that there has been much discussion in the literature about 
its significance and suitability, see e.g. refs. [15-19]. Actually, one gets simply c M instead 
c, as defined in eq. (12.21) . Of course there is nothing special with the \i— parameter, except 
the fact that we have chosen to marginalize it using the experimental information about 
Mz, which is the usual practice. Had we chosen to marginalize another parameter, say 
M, we would have got cm, but of course at the end the results would be the same. 

A convenient way to view eq. (12.61) is to imagine that we start with an MSSM parameter 
space {s, m, M, A, B} where /i has been eliminated using the experimental value of Mz- 
Then the pdf appears as the likelihood associated to the experimental information (except 
M^ xp ) times an effective prior 

PcS (s, m, M, A, B) = 2 ^1 p(a m M Aj B ) , (2.7) 

where for simplicity we have assumed that the prior in /i factorizes from the rest. This 
means that the initial prior gets multiplied by a factor 2^ ^1 that carries the fine- 
tuning penalty. In Fig. 1 we have plotted this factor in representative slices of the 
{s,m, M, A, B} parameter space (using the two basic choices p(fi) cx const., p(fi) oc l/jj) 
for some illustrative and physically relevant cases. In all of them large soft parameters get 
penalized (except partially for focus-point regions [20,21]). There are no ad hoc assump- 
tions for this result, it just comes out from the value of M^ xp and the marginalization of 
\t. 

For practical calculations it is useful to have an approximate expression for c M . From 
the tree-level condition (12. ip we see that c M ~ 2/i 2 /M§. Nevertheless, using the approx- 
imate analytic formulas discussed in sect. 3, it is possible to write a much more refined 
expression for c M , which we postpone to that section. 

2.2 Nuisance variables and the role of the Yukawa couplings 

It is common in statistical problems that not all the parameters that define the system are 
of interest. In the problem at hand we are interested in determining the probability regions 
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Figure 1: Values of the factor \i / (M z c^) (in logarithmic units and up to a convenient 
proportionality constant) in the {m, M} plane for // > 0, A = 0, 5 = (upper plots), and 
for /i < 0, A = and the minimal SUGRA relation B = A — m (lower plots), using the 
two basic initial priors, oc const, (left plots), p(/i) ocl/fi (right plots). The plotted 
factor appears in the effective prior given in eq. (j2.7p . 
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for the MSSM parameters that describe the new physics, i.e. {m, M, A, B, /i}, but not (or 
not at the same level) in the SM-like parameters, denoted by {s}. However, the nuisance 
parameters {s} play an important role in extracting experimental consequences from the 
MSSM. The usual technique to eliminate nuisance parameters is simply marginalizing 
them, i.e. integrating the pdf (12.61) in the {s} variables (for a review see ref. [22]). 
When the value of a nuisance parameter is in one-to-one correspondence to a high-quality 
experimental piece of information (included in £ res t) ; this integration simply selects the 
"experimental" value of the nuisance parameter, which thus becomes (basically) a constant 
with no further statistical significance in the analysis. In particular, the prior on such 
nuisance parameter becomes irrelevant. In the MSSM, nuisance parameters of this class 
are the gauge couplings, {#3, g, g'^, which thus can be extracted from the analysis. 

In the pure SM a similar argument can be used to eliminate the Yukawa couplings, 
since they are in one-to-one correspondence to the quark and lepton masses. However, as 
discussed in sect. 1, in the MSSM these masses depend also on the value of tan (3 = v 2 /vi, 
which is a derived quantity that takes different values at different points of the MSSM pa- 
rameter space. This means that two viable MSSM models (with the same fermion masses) 
will have in general very different values of the Yukawa couplings, and thus the theoretical 
prior, p(y), will play a relevant and non-ignorable role in their relative probability. Any 
Bayesian analysis of the MSSM amounts to an explicit or implicit assumption about the 
prior in the Yukawa couplings. 

In order to make these points more explicit, let us temporarily simplify the discussion 
approximating the experimental likelihood related to the fermion masses as 

Acrmion masses = S(m t ~ m^) 5(m b - 171°^) .... (2.8) 

(which is a fair approximation). This is a factor of the global likelihood, £ ros t. Likewise, 
let us approximate the theoretical values of the fermion masses as 

1 1 

m t = -j=y x ™vsp, m b = -j=y b ™vcp, etc. (2.9) 

where sp = sin/3, eg = cos/3 and y\ ow are the low-energy Yukawa couplings. As it is well- 
known these expressions correspond to the running masses. The physical (pole) masses 
include a radiative correction that we have ignored here, but not in our full analysis. 
A further simplification is to assume y\ ow = Riyi, where yi are the high-energy Yukawa 
couplings (and thus the input parameters) and the renormalization-group factor Ri does 
not depend on yi itself (this is not a good approximation for the top Yukawa coupling, 
but we will assume it momentarily for the sake of clarity). Now, the marginalization in 
the Yukawa couplings can be readily done, integrating the pdf given by eq. (12.61) in the yi 

6 Strictly speaking, the initial theoretical inputs are the gauge couplings at high energy, which are re- 
lated to the experimental (low-energy) ones by the renormalization-group running. This running depends 
on the other MSSM parameters through the position of thresholds associated with different particles. 
Hence, two viable MSSM models have slightly different values of the gauge couplings at high energy, 
and thus the theoretical prior on the couplings would play an (almost insignificant) role in the statistical 
comparison of the two models. 
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variables. Writing just the relevant terms we get 



[dy t dy h ---\ p(y, m, M,A,B\ data) = / [dy t dy b ---} p(y)8(m t - mf p ) 5(m b - m^ xp ) • 



p(y) 



dyt 



drru 



dyb 



drrih 



p(y) *p Cp L ■ ■ ■ (2.10) 



where p(y) denotes the prior in the Yukawa couplings (which we assume that factorizes 
from the other priors). Eq. (12.101) represents the footprint of the Yukawa couplings in the 
pdf. Note that the factors s^ 1 c^ 1 ■ ■ ■ arise from the change of variables yi — > rrii, even if 
the likelihood is not approximated by deltas. There are as many such factors as quarks 
and leptons. This amounts to a dramatic modulation of the relative probability of MSSM 
regions with different tan/3 if one chooses a flat prior, p(y) = const. If, instead, one takes 
logarithmically fiat priors, i.e. p(yi) oc 1/yi, then the s^ 1 c^ 1 • • • factors get cancelled, so 
that the elimination of the Yukawa couplings does not leave a footprint in the probability 
density of the (non-nuisance) MSSM parameter space, {m, M, A, B, p}. 

In previous Bayesian analyses of the MSSM the role of the Yukawa couplings was not 
considered to this extent. Essentially, their values were taken as needed to reproduce the 
experimental fermion masses, within uncertainties. As we have seen, this practice approx- 
imately corresponds to assuming logarithmically flat priors in the Yukawa couplings^. 

The above discussion is however oversimplified. As already mentioned, the marginal- 
ization in the top Yukawa coupling (and sometimes the bottom one) produces extra factors 
due to the dependence of Rt on y t . Actually, since one is marginalizing simultaneously 
in the Yukawa couplings and the p— parameter one has to evaluate the full Jacobian 
of the transformation {p,yt} — * {Mz,m t }, which introduces additional contributions. 
Furthermore, the picture gets more complicated due to the fact that, for a given choice 
of {m, M, A, B}, there may be several values of p leading to the correct value of Mz 
with different values of tan /3 and thus of the Yukawa couplings. This means that in the 
marginalization one has to sum over all these possibilities. This is technically annoying 
and reduces the clarity of the approach. These drawbacks can be eliminated by trading 
in the statistical analysis the initial B— parameter by the derived tan j3 parameter, as we 
discuss in the next section. 

Let us finally mention that in the analysis of ref. [2] the fermion masses themselves, 
rather than the Yukawa couplings, were taken as SM-like variables. The advantage of such 
procedure is that these nuisance variables are in obvious one-to-one correspondence to the 
experimental data. Then the priors on the masses become almost irrelevant, and they can 
be integrated out, almost without leaving any footprint. However, this has two problems. 
First, the fermion masses are obviously derived quantities and should not be taken as 
initial input variables, even if this makes life easier. Second, such procedure introduces 
completely artificial factors, as it will become clear at the end of the next section. 



7 Actually, for independent reasons, we find the logarithmically flat prior for Yukawa couplings a most 
sensible choice. Certainly there is no convincing origin for the experimental pattern of fermion masses, 
and thus of Yukawa couplings. However it is a fact that these come in very assorted orders of magnitude 
(from 0(1O -6 ) for the electron to 0(1) for the top), suggesting that the underlying mechanism may 
produce Yukawa couplings of different orders with similar efficiency. 
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3 Efficient variables to scan the MSSM parameter 



space 

In MSSM analyses it is normally very advantageous, both for theoretical and phenomeno- 
logical reasons, to trade the initial B— parameter by the derived tan/3 parameter. On the 
phenomenological side, tan /3 is a parameter that appears explicitly in the predictions for 
many physical processes, such as cross sections, branching ratios, etc. (this is unlike B, 
that enters only in a very indirect way) . Thus it is convenient to get the probability density 
of the MSSM parameter space as a function of tan j3. On the theoretical side, for a given 
viable choice of {m, M, A, tan /?}, there are exactly two values of \x (with opposite sign and 
the same absolute value at low energy) leading to the correct value of Mz- Thus working 
in one of the two (positive and negative) branches of /x, each point in the {m, M, A, tan/3} 
space corresponds exactly to one model, whereas a point in the {m, M, A, B} space may 
correspond to several models, introducing a conceptual and technical complication in the 
analysis, as mentioned in the previous section. 

Changing variables B — > tan/3 amounts to a factor dB/d tan /3 in the pdf. On the 
other hand, we have seen in sect. 2 that it is convenient to trade fi and y t by Mz and m t , 
as this makes the marginalization of these variables easier and more transparent. Thus 
we should compute the whole Jacobian, J, of the transformation 

{v,y t ,B} {M z ,mt,t}, t = tan/3, (3.1) 

so that, in the new variables, the pdf reads 

p(gi,m t ,m, M, A,tan/3| data) = £ rcst J| M = Mo p(9i,yt,m, M, A, B, \i = fi ) . (3.2) 

Here we have made explicit the dependence on the gauge couplings, and the top Yukawa 
coupling and mass, but not on the other fermions'. In this equation we have already 
marginalized M z using the associated likelihood ~ S(M Z — M^ xp ) (recall that fi is the 
value of /i that reproduces the experimental Mz-) The combination 

p c s(gi, rn t , m, M, A, tan (3) = J| M=A1() p(gt, y t , m, M, A,B,fi = /i ) (3.3) 

can be viewed as the effective prior in the new, more convenient, variables to scan the 
MSSM. Note that, as discussed in subsect. 2.2, the gauge couplings are fairly irrelevant for 
the statistical analysis, so we will drop them in what follows. In order to work out J we 
need the dependence of the old variables on the new ones, which can be derived from the 
minimization equations of the scalar potential, V(H\, H2), and from the expression of the 
top pole mass. For the numerical analysis we have used the SOFTSUSY code [13] which 
implements the full one-loop contributions and leading two-loop terms to the tadpoles 
for the electroweak symmetry breaking conditions with parameters running at two-loops. 
This essentially corresponds to the next-to-leading log approximation. However, in order 
to highlight the most relevant facts it is useful to write down the expressions arising from 
the minimization of the tree-level potential with parameters running at one-loop (i.e. 
essentially the leading log approximation): 

Plow — t 2 — 1 2~ 
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Vk: 
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(3.6) 



Here the "low" subscript indicates that the quantity is evaluated at low scale (more 
precisely, at a representative supersymmetric mass, such as the geometric average of 
the stop masses). The soft masses m 2 H . are also understood at low scale. For notational 
simplicity, we have dropped the subscript t from the Yukawa coupling. We are not making 
explicit the role of the bottom Yukawa coupling, which is treated in a similar foot to the 
top one. Note that all these low-energy quantities contain an implicit dependence on the 
top Yukawa coupling through the corresponding renormalizat ion-group equations (RGEs). 
The effect of the one-loop corrections on the effective potential to the previous expressions 
is incorporated by correcting the soft masses vr? H , with one-loop tadpole effects along the 
lines of ref . [23] . Similarly the pole top mass is given by the running top mass, appearing 
in eq. (13.61) . plus a radiative correction A rad m t . Eqs. (I3.4H3.6I) . even when corrected with 
the mentioned radiative effects, have the structure 



H = f(M z , y,t), y = g{M z , m t , t), B = h(n, y, t) 



(3.7) 



where we only make explicit the dependence on the variables involved in the change of 
variables (13.11) . Note that y depends on Mz since v oc Mz- Notice also that, unlike 
eqs. (l3.4H3.6p . eqs. (13.71) are defined in terms of the the high-energy parameters. 

From eqs. (13.71) it is straightforward to evaluate the Jacobian J of the transformation 
(13. ip . and thus the effective prior (13.31) . J gets simply 



J 



d/i dfj. d/i 
dMz dt dmt 



dB 
dM z 

dy 
dMz 



dB dB 

dt dmt 

dy dy 

dt dmt 



df dg dh 



dM z dm t dt 



(3.1 



where the factor df /dMz carries essentially the fine-tuning penalization discussed in 
subsect. 2.1. 

We can give an analytical and quite accurate expression of J by using the approximate 
equations (l3.4H3.6p . and expressing the low-energy values of /i, B, y in terms of the high- 
energy ones through the integrated 1-loop RGEs. Schematically, 



R„{y)^ B low = B + A RG B(y) 



(3.9) 



where R^y), AuaB(y) are definite functions of y (and other parameters, but not n and 
B) [24]. Similarly, 



yE(Q low ) 
■f 6?/F(Q low ) 

10 



(3.10) 



where Q is the renormalization scale, F = Jq*°™ E\nQ, and E(Q) is a definite function 
that depends just on the gauge couplings [25]. Plugging ( 13. 9ft and ( 13. 10f> into eqs. ( 13.4b 
13. 61) we get explicit expressions for the /, g, h functions. The relevant derivatives, to be 
plugged in (13.8j) . read 

df M z 1 M z 1 . . 

(3.11) 



dM z /i 2i?2 ^2^ 



dh _ 1 - t 2 



dg E ( y 



dm t v S/3 Vz/iow 



(3.13) 



Let us comment briefly on these expressions. As mentioned above, eq. ( 13.111) is essentially 
the fine-tuning factor 2ji/ (Mzc^) obtained in subsect. 2.1 [eq. (12.61) ]. It penalizes large 
scales for fi. Eq. (13.121) counts the volume conversion from dB to dt and it is proportional 
to a soft mass just for dimensional reasons. Note that this factor penalizes low scales. 
This is easy to understand looking at eq. (1 3 . 5 p : for a given interval in tan/3, the larger 
the values of the soft masses and /i, the larger the corresponding interval in B is. So 
larger B is favoured. Note, however, that the size of the interval of B relative to the 
value of B itself (which is statistically meaningful) is essentially constant. Indeed, the 
5-factor in eq. (13.121) will be cancelled in the pdf if one uses logarithmic flat priors for 
the soft terms, p(B) oc 1/B. This reasoning is similar to that after eq. ( 12.61) . Finally, 
eq. (13 . 131) corresponds to eq. ( 12 . 101) of our preliminar discussion. In particular, the l/s@ 
factor corresponds to the same factor in ( 12.101) . 

The Jacobian of the transformation (13. ip is given by the product of the three factors 
of eqs. (l3~TTH3~m 



J = \(9 2 + 9 ,2 Y /2 



E 



2 



Bio W t — l I y \ -l 



11 t{l + t 2 ) \ yl 



■ (3-14) 



In the previous derivation we have considered just the top Yukawa coupling in the 
change of variables (13.11) . Once the others fermions are taken into account, the Jacobian 
gets a s~p l factor for each u— type quark and a c^ 1 factor for each d— type quark and 
charged lepton, as discussed in subsect. 2.2. Now, recall that the effective prior in the 
new variables is the product of J by the initial prior, as expressed in eqs. (13.21 13.3j) : so 
taking a logarithmically flat prior for the Yukawa couplings (i.e. p(yi) oc y~ ) the si , c^ 1 
factors get cancelled in the effective prior and the pdf. For the top Yukawa coupling 
(and sometimes for the bottom one) this cancellation still leaves a residual dependence 

I Sfl 1 x — j which through (13.10 1) depends on y itself and thus 

yiowj y yiow 

on tan j3. 
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Therefore, the effective prior defined by eq. (13. 3p takes the approximate form 



Pes(mt, m, M, A, tan/3) oc 



E 

v 



y t 2 - 1 B 



low 



l/l ow t(l+t 2 ) yU 



p(m, M, A,B,fi = no) ■ (3.15) 



The most basic priors for the initial variables are the flat and the logarithmic ones, i.e. 



p(m, M, A, B, fi) = const. 



p{m,M,A,B,fi) 



1 



mMABji 



(3.16) 



Some comments are in order here. First, the normalization factors in (13.161) are determined 
by the integrated probability and thus depend on the bounds one establishes for the 
parameters. Since we are discussing here relative probabilities in the parameter space, they 
are not relevant at this stage, but they become more important when some parameters 
are marginalized. Second, as argued in subsect. 2.1, the logarithmic prior is physically 
sensible and is the one that can catch the intuition that fine-tunings are statistically 
unlikely. Actually, when plugged in (13.151) . the logarithmic prior gives rise to the fine- 
tuning penalization l//i 2 ~ l/c M . However, the simple logarithmic prior of eq. (13.151) 
is clearly too simple, since it cannot be normalized due to low-energy and high-energy 
divergences. These are easily cured by taking reasonable upper and lower bounds on the 
parameters, e.g. [10 GeV, Mx]- In fact, this choice can be refined. From the 1-loop RGE 
of the initial parameters, it is clear that very small values for m, A, B are not radiatively 
stable, due to sizeable contributions proportional to the gaugino mass M. Therefore, it is 
not very sensible to assume that values of these parameters smaller than say O(10~ l M) 
at precisely Mx can have a particular statistical meaning. Thus we can take flat priors 
at this region of small values. On the other hand, the experimental lower bounds on the 
gluino, charginos and neutralinos imply that M and /i cannot be smaller than (9(100) 
GeV. 



In Fig. 2 we show the effective prior defined in (I3.3P and computed using eq. (13.81) [with 
the full one-loop expressions of eqs. (I3.4H3.6I) ] for the two priors discussed after eq. (13.161) . 
i.e. flat and logarithmically flat. The plots show, up to a constant of proportionality, 
the effective priors in the {m, M} plane (with constant tan fl, A) for some representative 
caseg^]. We have assumed in the figures that the soft terms are initially given at the scale of 
gauge unification, Mx ~ 10 16 GeV, as essentially happens in scenarios of gravity-mediated 
SUSY breaking, but of course our formulas are also applicable to e.g. gauge-mediated 
SUSY breaking scenarios. The penalization of large scales is clear for the logarithmically 
flat expected from our discussion. The fact that using a logarithmic prior penalizes 

large values of the parameters could seem quite obvious. However, this is not so clear 
when one compares the integrated probability that the the parameters are within different 
ranges of scales. For instance, the logarithmic prior alone would give more probability to 



8 The proportionality constant is simply the normalization constant of the initial prior, eqs. (|3.16p . 
times the normalization constant of the Yukawa prior. However, these factors play no role in the exam 
of the relative probabilities of the points in the parameter space. Obviously, the absolute values of the 
left plots cannot be compared to those of the right plots, as they are affected by a different normalization 
constant. We prefer not to include these normalization factors, as they depend on the upper limit assumed 
for the soft terms, and do not shed any additional light on the relative probabilities inside the parameter 
space. 
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Figure 2: Values of the effective prior, p e g, in logarithmic units as defined in eq. 03. 3D (up 
to a normalization constant), in the {m, M} plane for A = and tan/3 = 3 (upper plots), 
tan/3 = 10 (central plots), tan/3 = 30 (lower plots). The left and right plots correspond 
respectively to the two basic choices of priors (flat and logarithmically flat) discussed in 
eq. (13.161) and below. See text for further details. 
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Figure 3: The same as Fig. 2, but in the {M, tan/3} plane, for A = 0, m = M. 

the [100 TeV, Mx] range than to the [100 GeV, 100 TeV] one. However, the presence of 
the mentioned fine-tuning factor, l//i 2 , in the effective prior still penalizes the high-energy 
regions. 

Fig. 3 is similar to Fig. 2, but showing now slices in the {M, tan /?} plane (with the 
condition m = M). The plots illustrate the tan/3 dependence of the effective prior, which 
can be essentially extracted from the approximate expression ( 13.151) . [Note that, besides 
the explicit dependence, eq. (13.151) contains an implicit dependence on tan /3 through the 
R^, B\ ow and y/y\ ow factors.] We can appreciate from the plots that the prior probability 
decreases with tan j3. 

The effective prior computed and shown in the figures corresponds to the last two 
factors of the pdf (13. 2p . The first factor, i.e. the likelihood, carries the experimental 
information (fermion masses, electroweak precision tests, g-2 of the muon, dark matter 
constraints, etc.). Whatever experimental information (and thus likelihood) we may use, 
it will be always weighted by the same effective prior factor shown here. 

In this section we have argued so far that the sensible initial choice of independent 
parameters of the MSSM is {gi,yt,m, M, A, B, /i}, while for practical reasons it is most 
convenient to work with the set {g,, m t , m, M, A, tan f3, Mz} (and sign/x). Mz is eliminated 
from the analysis using its extremely sharp likelihood. The effective prior in the new 
variables is then given by eqs. (I3.3[l3.8p . for which we gave explicit approximate expressions 
in eqs. flSJH EDS) ■ 

It is interesting to wonder what would have been the result if one had insisted in 
taking directly m t as an initial (nuisance) variable, so that the transformation (13. ip would 
have just involved {/i, B} — > {Mz, t}, as has been done e.g. in ref. [4]. As argued in 
subsect. 2.2, it is theoretically bizarre to take m t as a fundamental variable, instead of y t . 
However, one may gain the bonus of almost no sensitivity to the prior in m t , since this is 
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essentially fixed by the experiment. This is true, but this procedure introduces extremely 
counter-intuitive contributions to the Jacobian, as we will see briefly. The new 2-variable 
Jacobian is given by 



dfj. 



dB 



dM z 



t,m t 



t.mt 



dfi I 

dt I Mz,mt 



dB I 

dt \Mz,mt 



(3.17) 



where the subscripts emphasize which variables have to be kept frozen in the partial 
derivations. Now, using the definitions (13. 7p . it is straightforward to obtain 

df dh dg ( df dh df dh\ df dh dg 
dM z ~dt 



Jo 



dM z \dy dt 



d£ dh\ 
dt dy J 



dM z dy dt 



(3.18) 



It is amusing that this expression is more complicated than in the 3- variable case, eq. (13.81 ) . 
This comes from the fact that the derivatives in (13.171) contain contributions coming from 
the dependence of fi and B on y, which is in turn a function of t and Mz, eq. (13.61) . 
These contributions were cancelled inside the 3-variable Jacobian thanks to the third row 
in the matrix of eq. (13. 8p . but they are not cancelled here and give rise to the second and 
third terms in eq. (13.181) . Note that the first term in (13.181) is similar to the 3-variable 
Jacobian given by eq. (13.81) . whose physical significance (including the information about 



fine-tuning) was discussed after eq. (I3.13P . This term goes parametrically as B/u, and was 
the only one quoted in ref. [4], thus the resemblance of their result to our approximate 
expression (13.141) . except for the RG and s^ 1 factors. However the second term goes 
parametrically as Bm 2 / fiMl, and thus is much more important for large soft terms, 
which then become strongly favoured (contrary to the intuitive expectatives). Therefore 
there is no reason to have ignored such term. In consequence, the expressions used in 
ref. [4] are much closer to using y t as a fundamental variable with logarithmically flat 
prior than to using m t . 

Let us finish this section by using the approximate expressions discussed above to 
give, as advanced at the end of subsect. 2.1, an approximate expression for the fine-tuning 
parameter c M . Recall that this parameter was defined as 



d In M\ 



<91n/i 



(3.19) 



y,B 



where the subscript indicates that the partial derivative must be performed at y, B con- 
stant. Using eqs. (13. 7p . 



c M can be written as 



2/i / df 



M z \dM 



df_ dh fdhX 
+ ~dt dfi \dt) 



(3.20) 



where the right hand side has to be understood in absolute value. As above, using (13.91) 
and (13.101) we obtain explicit approximated expressions for the f,g,h functions. Then 
eq. (13T201 reads 



4R 



"Ml 



tHl + t 1 



m 



hi 



— m 



H 2 



(t 2 - l) 3 Bi ov/ Hi 



B 



low 



At 2 



(3.21) 
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Note that the combination m 2 Hi —m 2 H2 can be easily written in terms of B, /i using eqs. (13.41 

E3}. 

4 Conclusions 

The start of LHC has motivated an effort to determine the relative probability of the 
different regions of the MSSM parameter space, taking into account the present (theoret- 
ical and experimental) wisdom about the model. These attempts are often called "LHC 
forecasts" [1-4,6-8]. The central equation to extract this valuable information is the 
fundamental Bayesian relation 

p(s, m, M, A, B, /i|data) oc C(s,m, M,A, B, fi) p(s,m, M,A, B, fi) , (4.1) 

which gives this probability in terms of the usual experimental likelihood, C, and the prior 
p(s,m, M, A, B, fi), i.e. the "theoretical" probability density assigned a priory to points 
in the space spanned by the MSSM parameters {m, M, A, B, n} and the SM-like ones (s). 

Since the present experimental data are not powerful enough to select a small region of 
the MSSM parameter space, the choice of a judicious prior becomes most relevant. Indeed, 
ignoring this amounts to an implicit choice for the prior (which is not always sensible). 
On the other hand, it is common lore that the parameters of the MSSM, {m, M, A, B, //}, 
should not be far from the electroweak scale in order to avoid unnatural fine-tunings to 
obtain the correct scale of the electroweak breaking. Previous studies have attempted to 
incorporate this reasonable intuition to the Bayesian approach, by choosing a prior that 
counted (more or less explicitly) a conventional measure of the fine-tuning, typically the 
Barbieri-Giudice parameter, c, defined in eq. (12.21) . 

However, though reasonable, these kinds of proposals are rather arbitrary, as the very 
measure of the fine-tuning is. On the other hand, since the naturalness arguments are 
deep down statistical arguments, one might expect that an effective penalization of fine- 
tunings should arise from the Bayesian analysis itself. One of the main results of this 
paper has been to show that this is really so: using the fact that the likelihood associated 
to the experimental Mz is essentially a Dirac delta, ~ S(Mz — M™ p ), one can easily 
marginalize the /i-parameter (i.e. integrate the density of probability in this variable). 
Then one gets an effective prior for the remaining parameters 

PcS (s, m, M, A, B) = 2-^-- p{s, m, M, A, B, Mo ) , (4.2) 

M z c M 

which exhibits the fine-tuning penalization, (/io is the value of /i that reproduces the 
experimental Mz for the given values of {s, m, M, A, B}.) Of course this effective prior has 
to be combined with the experimental likelihood, except the part associated to the Z mass. 
The initial prior, p(s,m, M, A, B, /i), can be taken as flat or (preferably) logarithmically 
flat, as usual. We find very satisfactory that precisely the usual parameter to quantify 
the degree of fine-tuning emerges in the Bayesian approach "spontaneously", not upon 
subjective assumptions, especially taking into account that there has been much discussion 
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in the literature about its significance and suitability. We have completed this analysis 
by giving an explicit and quite accurate expression for c M , see eq. (13.211) . 

Our second result concerns the treatment of the Yukawa couplings. In previous 
Bayesian analyses the Yukawas were essentially taken as needed to reproduce the ex- 
perimental fermion masses, within uncertainties. However, unlike the pure SM, in the 
MSSM the Yukawa couplings are not in one-to-one correspondence to the quark and lep- 
ton masses: they depend also on the value of tan /3, which is a derived quantity that takes 
different values at different points of the MSSM parameter space. This means that two 
viable MSSM models (with the same fermion masses) will have in general very different 
values of the Yukawa couplings, and thus the theoretical prior, p(y), will play a relevant 
and non-ignorable role in evaluating their relative probability. Any Bayesian analysis of 
the MSSM amounts to an explicit or implicit assumption about the prior in the Yukawa 
couplings. We have made explicit the dependence of the results on such prior and shown 
that the easiest and usual practice of taking the Yukawas "as required", approximately 
corresponds to taking logarithmically flat priors in the Yukawa couplings, which on the 
other hand is not an unreasonable choice at all. 

Finally we have repeated this analysis, using a more efficient set of variables to scan 
the MSSM parameter space. Besides trading /i by Mz and the Yukawa couplings (in 
particular the top one) by the fermion masses, it is known that trading B by tan/3 is 
highly advantageous. Following similar steps one can arrive to an effective prior in the 
new parameters: 

p e ${gi,m t ,m,M,A,t&n/3) = J| M=/i0 p(g^ Vu m, M, A,B,fx = // ) , (4.3) 

where J is the Jacobian of the transformation 

{fi,y t ,B} {M z ,m 4 ,t}, f = tan/3 (4.4) 

(Mz does not appear in the right hand side of H4.3[) since it is marginalized as explained 
above.) Note that still the initial choice of independent parameters is {y t , m, M, A, B, fi} 
(on which the initial priors are defined). It is the change of variables plus the marginaliza- 
tion of Mz what leads to the above effective prior. We have calculated J both numerically 
and analytically (in an approximate but quite accurate fashion). The relevant formulas 
are eqs. (13.81) and (l3.14p . The last expression is very handful and leads to the effective 
prior given in eq. (13.151) . Whatever experimental information (and thus likelihood) one 
may use, it will be always weighted by the same effective prior factor calculated (and 
shown in plots for illustrative cases) here. 

We have also discussed the results in comparison with other approaches in the litera- 
ture, arguing that the present one is conceptually more satisfactory. 
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